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1 

Title: In Vivo Recombination 

Field of the invention 

The present invention relates to a method for in vivo 
5 recombination of homologous DNA sequences. The method is a 

forced artificial evolution resulting in a DNA sequence encoding 
a polypeptide having an advantageous property. 

back ground of the invention 

10 Homologous recombination between DNA sequences placed on 

the same plasmid is well known to the skilled artisan (Weber and 
Weissmann, Nucl. Acid. Res. (1983) .11, 5661-5669). EP 252666 Bl 
(Novo Nordisk A/S) , W095/22625 Al (Affymax Technologies N.V.) 
and WO9101087 discloses in vivo recombination of genes placed on 

15 the same plasmid. 

EP 449923 Bl (Setratech) discloses a process of inter- 
genie recombination in vivo of partially homologous DNA se- 
quences. The recombination takes place in cells of which the en- 
zymatic mismatch repair system is defective. 

20 J. Biotechnol. (1991) 19, 221-240 discloses a process 

for inactivating a gene in the genome of B . Amyloliquefaciens. 
The process involves integration and excision of a pE194 mutant. 

In Chemical Abstracts 118:161925 (1993) the minimal se- 
quence length of homology necessary for recombination is deter- 

25 mined. The method of determination involves integration of a 
pE194 derivative carrying a gluconase gene. 

In J. Bacterid. (1982) 152 , 524-526 a plasmid pBD9, 
which comprises the two plasmids from Staphylococcus aureus, 
pE194 and pUBHO, is disclosed. Other plasmids based on pE194 or 

30 pUBHO are disclosed in Molecular and General Genetics (1988) , 
213, 465-70, Molecular and General Genetics (1984), 195, 374-7, 
Plasmid (1981), 6, 67-77 and Genetika, (1986) 22, 2750-7. 

In Yichuan Xuebao (1993), 20, 272-8 (see Chemical Ab- 
stracts 120:1670 (1994)), Chen et. Al. disclose the plasmid 

35 pN 1 W102, a pE194 derivative, in which the thermostable a-amylase 
gene from Bacillus licheniformis is inserted. pNW102 was trans- 
formed into the" Bacillus subtilis strain BF 7658, followed by 
incubation at non-permissive temperature, allowing homologous 
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recombination between the a-amylase genes of pNW102 and BF 7658. 
The a-amylase produced of the recombinant strains shows the same 
characteristics as the a-amylase from Bacillus licheniformis . 

However, in the related art there is no indication of 
5 that DNA sequences can be recombined in vivo by repeating inte- 
gration and cross-out in order to obtain a gene encoding a 
polypeptide with improved characteristics, such as increased 
stability, improved specificity or higher activity. 

Accordingly, a problem of the invention is to provide an 
io improved in vivo process of generating new DNA sequences. 

Rrief Description of the i nvention 

The present invention is directed to a in vivo process 
of generating new DNA sequences. It has turned out that new DNA 
is sequences with improved properties can be generated in a fast 
and efficient way from at least two homologous DNA sequences by 
a process involving in vivo exchange of DNA between at least two 
vectors, each containing a homologous DNA sequence and an origin 
of replication. 

20 T he process of the invention is characterized by, that 

said exchange of said homologous sequences is done by repeating 
in vivo at least once, said integration and excision of one vec- 
tor into the other. 

Accordingly, in a first aspect the invention relates to 

2= a process for in vivo recombination of homologous DNA sequences, 
comprising the following steps 

(a) incubating a cell containing at least two DNA struc- 
tures, each comprising a DNA sequence and an origin of 
replication, under conditions which favour integration 

30 0 f one of the DNA-structures into one of the others, 

thereby forming a hybrid DNA structure; 

(b) incubating the cell under conditions which favour cross- 
ing out from said hybrid DNA structure, thereby forming 
novel DNA structures, each comprising a recombined DNA 

3 5 sequence and an origin of replication; 

(c) repeating steps a)-b) at least once. 
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One of the advantages of the in vivo recombination 
process of the invention is by repeating the exchange in vivo 
between the DNA sequences it is possible to obtain numerous dif- 
5 ferent recombination patterns between said DNA sequences. 

This is illustrated in figure 1 herein, which shows the 
result of in vivo recombination between the two DNA sequences 
Savinase and Savisyn performed as described in example 1. 

In figure 1 Savinase/Savisyn recombined clones such as 
10 clone no. 3, 8 12 are derived from more than one in vivo recom- 
bination event between the two DNA sequences. 

Brief description of the drawing 

Figure 1 illustrates the recombination of two DNA se- 
15 quences, placed on the plasmids pTR, temperature resistant since 
origin of replication is functional at 50°C, and pTS, tempera- 
ture sensitive since origin of replication not functional over 
45°C. Under non-permissive conditions (i.e. at 50°C) , pTS inte- 
grates into pTR, forming a hybrid plasmid. 
20 Figure 2 illustrates the cross out from the hybrid plas- 

mid, forming two new DNA sequences, each placed on a plasmid. 
Repeated crossing in and out may be done by e.g. temperature cy- 
cling or simply by keeping temperature pressure and selecting 
for resistance of pTS (i.e. Erm R ) . 
25 Figure 3 illustrates the plasraids which are obtained af- 

ter repeated integration and crossing out events. 

Further, figures 1 and 2 illustrate a DNA structure sys- 
tem suitable for in vivo measurement of hybrid structure forma- 
tion, and suitable for performing a forced cross out of the DNA 
30 hybrid structure. 

Fig. 1: DNA structure pTS: 

Suitable active promoter directing ex- 
pression of active protein 
Erm R : antibiotic resistance gene 
35 DNA structure pTR comprises: 

Not any active promoter to direct tran- 
scription of GFP 
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GFP: Green Fluorescent Protein 
The Erm R antibiotic resistance gene in pTS is active due 

to the active promoter. 

The GFP gene in pTR is inactive (i.e. the gene is non- 
transcribed) due to there is not an active promoter to drive ex- 
pression of the gene. 

Fig. 2: After formation of the hybrid structure accord- 
ing to the invention GFP is now active (driven by the promoter) . 
This makes it possible to in vivo measure the hybrid structure 
formation (see below for further details) . 

In contrary the Erm R antibiotic resistance gene is now 
inactive (i.e. the gene is non-transcribed) due to there is not 
an active promoter to drive expression of the gene. Forced cross 
out of the DNA hybrid structure is performed by incubating the 
5 cells under Erm R antibiotic pressure, thereby forcing a cross 
out of the DNA hybrid structure, since the cell is only able to 
express the Erm R antibiotic resistance gene after the crossing 
out of the DNA structure has taken place. 

Figures 4 and 5: Illustrate the two vectors pSX120 (fig. 
0 4) and pMB430 (fig. 5) used as described in example 1 to in vivo 
recombine the two DNA sequences Savinase and Savisyn. 

pSX120 comprises i) an open reading frame encoding Sav- 
inase, ii) an active promoter directing transcription of Sav- 
inase, and iii) a gene conferring resistance to chloramphenicol 



5 (Cam*) . 



P MB430 comprises i) a Savizyn DNA fragment, ii) a termi- 
nator (resulting in no transcription through the regions follow- 
ing this) , iii) a gene conferring resistance to erythromycin 
(£rm R ) , and iv) an open reading frame encoding Green Fluorescent 

)0 Protein (GFP) . 

Fig. 6: Illustrates the results of Savinase/Savisyn in 
vivo recombination performed as described in example 1. 

The schematic representation shows 12 of the shuf- 
f led/recombined clones. 
35 " Horizontally are given the positions of the 

restriction sites unique for Savizyn. 

The letter "S" denotes that the sequence at this position 
is identical to the Savinase sequence and the letter "Z" denotes 
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that the sequence stems from Savizyn.. All of the sequenced 
clones are different in their pattern of recombination. 

pef initions 

5 As used herein, the following terms have the following 

meanings: 

The term "DNA sequence" includes any DNA sequence which 
is desirable to modify according to the invention e.g. a DNA se- 
quence encoding, a polypeptide, for example an enzyme, pharma- 

10 ceutically active polypeptides, e.g. insulin growth honaone hu- 
man hormones or growth regulators, and sequences such as promot- 
ers, transcription or translation .regulators and other intracel- 
lular polynucleotides and polypeptides. 

The term "homologous" means the DNA sequences have some 

15 stretches of identical nucleotides, allowing recombination in 
vivo. Said stretches within the DNA sequences are at least 15 
base pair long, more preferably at least 25 base pair long, even 
more preferably at least 50 base long, and most preferably at 
least 150 base pair or more. 

20 The term "homologous 11 comprises stretches of DNA 

sequences that are from 60% identical to DNA sequences that 
differs in only one nucleotide. It is preferred that the 
stretches of DNA sequences are more than 70% identical, more 
preferably at least 80%, especially more than 90% identical, and 

25 more preferred that the stretches of DNA sequences are 95% 
identical, most preferred at least 97%. 

Within the stretches of the DNA sequences of interest, 
the DNA sequence homology referred to above is determined as the 
degree of identity between two sequences indicating a derivation 

30 of the first sequence from the second. The homology may suitably 
be determined by means of computer programs known in the art, 
such as GAP provided in the GCG program package (Program Manual 
for the Wisconsin Package, Version 8, August 1994, Genetics 
Computer Group, 575 Science Drive, Madison, Wisconsin, USA 

35 53711) (Needleman, S.B. and Wunsch, CD., (1970), Journal of Mo- 
lecular Biology, 48, 443-453). Using GAP with the following set- 
tings for DNA sequence comparison: GAP creation penalty of 5.0 
and GAP extension penalty of 0.3, the homologous DNA sequences 
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of interest exhibits a degree of identity preferably as de- 

scribed above. 

The term "Recombined DNA sequence" means the DNA se- 
quence which is a result of a recombination event between at 
5 least two DNA sequences. The recombination event may be a single 
recombination event or may be first a recombination event be- 
tween at least two DNA sequences followed by a cross out at a 
different site than the first recombination event. 

The term "DNA structure" means a DNA molecule, e.g. a 
10 vector, a plasmid, a chromosome (e.g. a Bacillus subtilis chro- 
mosome) , a virus, a phage, a transposon or a genome. 

The term "vector" means any linear or circular DNA mo- 
lecule that is able to contain a DNA sequence to be recombined 
and an origin of replication. The term includes any vector known 
is to the skilled artisan e.g. a plasmid or a phage. The DNA se- 
quence is optionally-but not necessarily- under control of a 
promoter . 

The term "a cell" used herein in connection with a in 
vivo recombination process of the invention is intended to cover 
20 both "a cell" and "a cell population". 

The term "transferring" means that the DNA structure is 
introduced into a cell. The DNA structure may be introduced using 
techniques known in the art for direct introduction of DNA, e.g. 
by use of electroporation, transformation of competent cells, pro- 
25 toplast transformation, transf ection, transduction, conjugation or 
ballistic transformation. 

The term "incubating" (or cultivating, growing) means 
that the cell is treated in such a way that the normal cell 
functions are active, especially the functions necessary for re- 
30 combination. The cell can be incubated on a solid substrate 
(agar petri disc) or the cell can be incubated under submerse 
conditions. It is preferred that the cell is incubated in a test 
tube, e.g. an eppendorph test tube. In the case of a tempera- 
ture-sensitive origin of replication, the test tube is conven- 
35 iently placed in an automatic termocycler. Further the cell may 
be incubated in standard shake-flaks. 

The term "origin of replication" (ori) means a region 
where replication of a DNA structure is initiated. The term 
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functional in connection with origin of replication means that 
replication can be initiated "from the origin. The term non- 
functional means that no replication is initiated under the 
given conditions, or that the initiation of replication is redu- 
s ced, e.g. to a level insufficient for survival of a cell con- 
taining a DNA structure with a marker gene and the ori, under 
appropriate selection pressure. 

The term "temperature sensitive" in connection with ori- 
gin of replication or vector means the vector is unable to rep- 

10 licate at e.g. increased temperatures , i.e. non-permissive con- 
ditions, which yet permit growth of the parent cell. 

The term "temperature resistant" in connection with 
origin of replication or vector means the vector is able to 
replicate at increased temperatures. 

is The* term "DNA library" is a library comprising numerous 

different DNA sequences. A DNA library may be made according to 
standard procedures known in the art, such as e.g. by in vitro 
DNA shuf f ling/DNA recombination (WO 95/17413, WO 95/22625) 
and/or Error-prone PCR and cassette mutagenesis. 

20 A DNA library herein nay comprise as low as 2-20 

different sequences, as e.g. a library where only one amino acid 
is varied. Further a DNA library may comprise numerous different 
DNA sequences, e.g. up to 10 10 different sequences. 

2 5 Detailed Description of the Invention 

The present invention comprises a process for in vivo 
recombination of homologous DNA sequences, comprising the steps 

(a) incubating a cell containing at least two DNA struc- 
tures, each comprising a DNA sequence and an origin of 

30 replication, under conditions where i) at least one ori- 

gin of replication is non-functional and ii) the cell is 
under a selective pressure which only allows growth of 
the cell if it contains the DNA structure which is hav- 
ing the non-functional origin (i.e. favouring an inte- 

35 gration of one of the DNA-structures into one of the 

others, thereby forming a hybrid DNA structure); 

(b) changing, of said conditions to conditions, where fewer 
origins of replication are non-functional than in step 
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a) (i.e. favouring a crossing out from said hybrid DNA 
structure) ; 

(c) repeating steps a)-b) at least once. 

5 The recombined DNA sequence will encode new polypeptides 

having new properties. It is then possible to incorporate a se- 
lection or screening system in the process of the invention in 
order to select and/or screen for desired properties encoded by 
of or more of the recombined DNA sequence (s) . 

io The number of repetition (s) in step (c) above may be one 

repetition or it may be more such as 5-10 repetitions or 50-100 
or even higher numbers of repetitions. 

It is preferred that integration is forced (or made fa- 
vorable for survival of the DNA structure) by incubation of the 

15 cell under such conditions where the origin of replication of a 
DNA structure is non-functional, and that cross out is forced by 
incubation of the cell under such conditions where the origin of 
replication of a DNA structure again is functional. 

In another preferred embodiment the routine depicted 

20 above in the steps (a) to (c) it is preferred that the cells are 
kept under constant conditions {e.g. constant temperature and 
selection pressure) where spontaneous cross-in and cross-out 
happens. That is the shuffling procedure in the cells is re- 
peated without changing the conditions repeatedly. 

25 In a number of DNA structures, such as in many Bacillus 

DNA structures (vectors) (Bacillus subtilis and Other Gram- 
Positive Bacteria, Sonenshein et al., 1993, American Society for 
Microbiology, Washington D.C.) the origin of replication com- 
prise a ori(+) and a ori (-) . ori(+) controls where replication 

30 of the first DNA strand starts, and ori(-) controls the ini- 
tiation point for replication on the second strand (Bacillus 
subtilis and Other Gram-Positive Bacteria, Sonensheim et al., 
1993, American Society for Microbiology, Washington D.C). 

In an embodiment of the process of the invention, at 

35 least one DNA structure does not comprise ori(-) (i.e. only com- 
prise ori(+)). Preferably the DNA structure which does not com- 
prise ori(-) is a DNA structure which origin (i.e. herein 
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ori(+)) is functional during the hybrid DNA structure formation 
performed according to the invention. 

One advantage of this embodiment of the invention is 
when only ori(+) is functional in said DNA structure, this DNA 
5 structure is in a single-stranded state for relatively longer 
period, compared to a DNA structure comprising both ori(+) and 
ori(-). This prolonged single-stranded state facilitate the hy- 
brid DNA structure formation according to a process of the in- 
vention. 

10 Preferably, the DNA structure which does not comprise 

ori(-) as described above is a Bacillus structure, more pre- 
ferred a Bacillus vector. 

In order to illustrate this concept, an example is a 
modified pTR (TR: temperature resistence) Bacillus vector which 

15 does not comprise ori(-) (see figure 1). If the hybrid formation 
is performed at 50°C only the origin of pTR is functional 
(origin of pTS (TS: temperature sensitive) is not functional) 
(see fig. 1/2). The DNA structure pTR only comprise ori(+) and 
is therefore in a single-stranded state for relatively longer 

20 period, which facilitate the recombination event and conse- 
quently hybrid formation with the pTS DNA structure (see fig. 
1,2) . 

It is more preferred that one DNA structure is a vector 
which is able to replicate under certain (permissive) conditions 

25 and unable to replicate under other (non-permissive) conditions. 
The vector may, for instance, be one which is temperature- 
sensitive for replication. Thus, the vector may be one which is 
unable to replicate at increased temperatures, which yet permit 
growth of the parent cell. The cell is initially incubated at a 

30 temperature permitting vector replication and subsequently after 
integration into the other DNA structure may have taken place, 
incubated at a temperature which does not permit vector repli- 
cation so that the vector is lost from the cells unless inte- 
grated. 

35 The vector may further comprise a selectable marker. In 

this case, the incubation at the non-permissive temperature may 
be conducted under selective conditions to ensure that only 
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cells containing the integrated vector (which includes the DNA 
sequence and the selectable marker) , will survive. 

Further, it is preferred to construct a system where it 
is possible in vivo to measure formation of the hybrid struc- 
ture. 

Preferably, this is done by constructing a system 
wherein an in vivo screenable or selectable protein is expressed 
only after the hybrid structure has been formed. 

Preferably, said screenable protein is a protein which 
is fluorescent under suitable exposure, and in particular said 
fluorescent protein is Green Florescent protein (GFP) or vari- 
ants thereof. 

Green Florescent protein (GFP) or variants thereof and how 
the GFP's are activated to fluoresce are described in the art 
5 and reference is made to (Crameri et al. Nature Biotechnology 
14:315-319 (1996); and Cormack et al. GENE 173:33-38 (1996)) for 

further details. 

One of the advantages of using such a screenable protein 
which is fluorescent under suitable exposure of light, and in 
o particular GFP or variants thereof, is that it is then possible 
by e.g. fluorescent-activated cell sorting (FACS) technology to 
selectively sort out. cells which comprise said hybrid structure. 
This may eliminate background of cells, which do not comprise 
said hybrid structure after performing of step a) according to 
5 the first aspect of the invention. 

FACS is a widely known screening technology and reference 
is made to (Cormack et al. "FACS-optimized mutants of the green 
fluorescent protein (GFP) " GENE 173:33-38 (1996)) for further 
details. 

In order to illustrate a suitable system enabling in 
vivo measurement of the formation of the hybrid structure, an 
example of a suitable syster. is shown in figure 1 and 2 . 

Alternatively to the use of a screenable protein as dis- 
cussed above, it may be advantageously to use a selectable 
35 marker, wherein said selectable marker is only transcriptionally 
active when the hybrid structure is formed. 
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Further, it is preferred to that the crossing out from 
the hybrid DNA structure, in "step b) of the first aspect of the 
invention, is a forced crossing out event. 

Forced crossing out from the hybrid structure may be 
5 performed by: 

i) constructing a system wherein the DNA hybrid structure 

does not express a selectable marker, and at least one 
of the at least two DNA structures in step a) of first 
aspect is able to express said selectable marker; 
10 ii) incubating the cell under selectable conditions, 

thereby forcing a cross out of the DNA hybrid struc- 
ture, since the cell is- only able to express the 
marker gene after the crossing out of the DNA hybrid 
structure has taken place, 
is Preferably, selectable marker gene mentioned above is 

expressed on a DNA structure where the origin of replication is 
non-functional during a forced integration of the DNA structures 
into a DNA hybrid structure performed as described above, and 
preferably the selectable marker gene is an antibiotic resis- 
20 tance gene. See below for further description of suitable se- 
lectable marker genes. 

In order to illustrate the concept of forced crossing 
out from the hybrid DNA structure, an example of a suitable sys- 
tem is shown in figure2 1 and 2. 
25 The selectable marker may be any marker known in the art, 

for instance a gene coding for a product which confers anti- 
biotic resistance to the cell, which confers prototrophy to an 
auxotrophic strain, or which complements a defect of the host, 
(e.g. dal genes introduced in a dal- strain; cf. B. Diderichsen 
30 (1986), Bacillus: Molecular Genetics and Biotechnology Applica- 
tions, A.T. Ganesan and J. A. Hoch, Eds., Academic Press, pp. 35- 
46. The selectable marker may, e.g., be excised from a known 
source or present on a vector, e.g. a plasmid, used for the con- 
struction of the DNA structure to be used in the method of the 
35 invention. 

This selection may be accomplished by growing the cells 
under selection pressure for the selection marker encoded by the 
marker gene. 
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The DNA structure (in which the vector is to be 
integrated) may contain a gkne encoding any selectable marker, 
e.g. of any type as described above in connection with the 
marker optionally carried by the vector to be used in the method 
5 of the invention. Thus f the marker gene may encode an antibiotic 
resistance such as resistance to kanamycin, tetracyclin, 
ampicillin, erythromycin, chloramphenicol, or a resistance to 
various heavy metals such as selenate, antimony or arsenate. 

In a preferred embodiment, the cell used in the process 

10 is defective or transitorily inactivated in the enzymatic mis- 
match repair system or has a reduced level of mismatch repair. 
In the synthesis of DNA, errors can occur and the resulting non- 
complementary pairs of bases are called mismatches. A process 
for the correction of errors (mismatches) in DNA exists. In E. 

15 coli the errors are very rapidly and accurately detected by two 
enzymes (MutS and MutL) enabling a third enzyme (MutU) to unwind 
the two DNA strands and a fourth enzyme (MutH) to cut the neo- 
synthesized strand on a sequence of DNA (GATC) itself methylated 
later. For a closer explanation, reference is made to EP 449923 

20 (Setratech), the contents of which hereby are incorporated by 
reference. 

In another preferred embodiment the cell used in the 
process is impaired or hampered in the major recombination en- 
zymes such as AddAB . It has been shown that a strain impaired in 
25 the AddAB genes shows a higher frequency of homologous recombi- 
nation. Both legitimate and illegitimate recombination frequency 
are increased. But especially the recombination frequency for 
illegitimate recombination is favoured when working with a 
strain impaired or hampered in the AddAB ATP-dependent nuclease 
30 (Meima, R ; Haijema, BJ; Dijkstra, H; Haan, GJ; Venema, G; Bron, 
S (1997) Role of enzymes of homologous recombination in ille- 
gitimate plasmid recombination in Bacillus-subtilis . JOURNAL OF 
BACTERIOLOGY Vol- 179 , No. 4 pp. 1219-1229.) 

In another preferred embodiment the cell used in the 
35 process is over-expressing one of the key recombination enzymes, 
recA. The recA enzyme is involved in homologous recombination by 
promoting renaturation of ssDNA to form the heteroduplex molecules 
necessary for the recombination event and initiates the exchange 
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of DNA strands. (PROGRESS IN NUCLEIC ACID RESEARCH AND MOLECULAR 
BIOLOGY Vol. 56 , pp. 129-223 * (1997) ) . The recA over-expression 
can be achieved by either classical mutagenesis of the host strain 
or cloning of a highly expressed recA gene. 
5 In another preferred embodiment the cell used in the 

process is subjected to UV, X-ray or DNA damaging agents to 
increase recombination frequency between the two homologous genes 
to be shuffled. Stress like UV, X-ray or DNA damaging agents is 
known to induce the recombination enzymes via the SOS response and 
10 activate enzymes involved in recombination processes like recA. 
(BACILLUS SUBTILIS AND OTHER GRAM-POSITIVE BACTERIA, pp. 529-537 
(1993) ) . 

In another preferred embodiment the pTR based plasmid used 
as a carrier of one of the genes to be shuffled is replaced by 

15 either a low-copy version of pTR or another B. subtilis compatible 
low-copy plasmid like pWVOl, (MOLECULAR AND GENERAL GENETICS Vol. 
249 , No. 1 pp. 43-50 (1995 )), pTA1060, (JOURNAL OF BACTERIOLOGY 
Vol. 142 , No. 1 pp. 315-8 (1980)) or pAMbl , (JOURNAL OF 
BACTERIOLOGY Vol. 157 , No. 2 pp. 445-53 (1984)). 

20 In a further preferred embodiment the one of DNA construct 

comprising one the genes to be shuffled is situated on the 
chromosome of the cell and another DNA construct, comprising 
another gene of interest to be recombined is a plasmid. When 
performing an in vivo recombination process of the invention the 

25 plasmid will be recombined with the DNA construct on the 

chromosome. Said plasmid may e.g. be a temperature sensitive 
plasmid. 

The advantages of using a low-copy pTR, or a system where 
one of the DNA constructs is situated on the chromosome as 

30 described above, is that the hybrid DNA construct after 

recombination of the two homologous genes will be the predominant 
form of the genes in the cell. A high-copy background of non- 
recombined pTR plasmids will inevitably reverse the gene hybrids 
formed in the first round back to wildtype after the second round 

35 of recombination and therefore prohibit any shuffling of the two 
genes. When using the chromosome as a carrier of one of the genes 
no wildtype background can coexist when the pTS has integrated via 
the homologous genes. 
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It is preferred that the cell used in the process is a cell 
suitable for large scale production of the product of the 
recombined DNA sequence. It is preferred that the cell is a 
bacterial cell, more preferred a Bacillus cell, most preferred a 
5 Bacillus subtilis cell. 

The DNA sequences to be recombined may be isolated/ cloned 
from organisms containing the sequences in a manner known per se. 
For instance, a suitable oligonucleotide probe may be prepared 
on the basis of a first DNA sequence of interest in order to 

10 obtain a second DNA sequence homologous to the first sequence. 
Alternatively, the DNA sequence can be amplified by PCR based on 
primers designed according to knowledge of conserved regions. 

I different embodiments of the present content is used 
the term "a set of homologous DNA sequences ("set A" "set B", 

15 "setC" etc.)", e.g. in relation to an embodiment of the 

invention relating to a process according to the invention, 
wherein the homologous DNA sequences are comprised of two or 
more different set of homologous DNA sequences ("set A" "set B", 
"setC" etc.), and wherein each of said set of homologous 

20 sequences ("set A" "set B", "setC" etc.) are inserted in a 

different DNA construct ("DNA construct A", "DNA construct B", 
"DNA construct C" etc.) according to the invention. 

This term is used to illustrate that an in vivo 
recombination process of the present invention may be used to 

25 recombine numerous "sets" of homologous DNA sequences. Each 

individual "set" of homologous DNA sequence (s) may individually 
comprise from only one to numerous (e.g. a "set" comprising a 
DNA library) individual homologous DNA sequence (s) . 

Each of said "sets" of homologous DNA sequences ("set A" 

30 "set B n , "setC" etc.) may then be inserted in a different DNA 
construct ("DNA construct A", "DNA construct B", "DNA construct 
C" etc.) according to the invention. Each of said different DNA 
construct are preferably different on important parameters, 
according to the invention, such as different origin of 

35 replication, different selectable markers etc. 

All of said DNA constructs ("DNA construct A", "DNA 
construct B", "DNA construct C" etc..) may then be transferred 
into a cell, according to the invention, and when performing a 



WO 98/28416 



15 



PCT7DK97/00567 



process for in vivo recombination of .the invention , then each of 
the individual DNA constructs ("DNA construct A", "DNA construct 
B" , "DNA construct C" etc..) may then recombine randomly 
according to the invention. After a few repetitions of a 
5 process, of the invention, this will give a DNA population, in 
the cell population, comprising numerous different combinations 
of different recombined DNA sequences, recombined randomly 
between all "sets" of homologous DNA sequences used in the 
process . 

10 Further, the term "a set of homologous DNA sequences 

(•'set A" "set B", "setC" etc.)" is used herein to illustrate one 
of the advantages of an in vivo recombination process of the 
invention, which is the flexibility of the system to recombine 
both set of homologous DNA sequences comprising one, few and/or 

is numerous (e.g. big DNA libraries) of individual homologous DNA 
sequences. 

Accordingly, an embodiment of the invention relates to 
process according to the invention, wherein said sets of 
homologous DNA sequences each only contain one DNA sequence 
20 (i.e. recombination of two or more homologous sequences). 

Further in embodiments of the invention the homologous 
DNA sequences to be recombined are derived from a pre-made DNA 
library and/or derived from individually different pre-made 
libraries . 

25 Homologous DNA derived from such DNA libraries are 

inserted into a vector according to the invention (i.e. DNA re- 
presenting the DNA library is ligated (inserted) into a vector). 
As described herein the process of the invention comprise at 
least two DNA structures (e.g. vectors), which are different in 

30 a way which allow forced in vivo recombination of said vectors. 
Homologous DNA representing DNA libraries may be 
inserted into only one and/or both of said different vectors. 

I.e. if for instance homologous DNA representing one DNA 
library is inserted (ligated) into one of said different vectors 

35 and homologous DNA representing another DNA library is inserted 
into the other vector this will allow fast and efficient 
recombination between two different libraries by an in vivo 
recombination process according to the invention. To illustrate 
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this concept, DNA representing one library may be inserted into 
a pTR vector (see figure 1) and DNA representing another library 
may be inserted into a pTS vector (see figure 1). Just a few re- 
peating in vivo recombination events performed by a process ac- 
5 cording to the invention will then give a cell population 
representing a new library which is a combination of the two 
libraries mentioned above. 

Accordingly, embodiments of the invention relates to a 
process according to the invention, wherein at least one of the 

10 sets of homologous DNA sequences is derived from a pre-made DNA 
library, and at least one of said sets of homologous DNA 
sequences is a DNA sequence which, is homologous to said DNA 
library (i.e. resulting in a process for in vivo recombination 
of said DNA library to a DNA sequence which are homologous to 

15 said DNA library) ; or 

a process according to the invention, wherein the sets 
of homologous DNA sequences is derived from the same pre-made 
DNA library (i.e. resulting in a process for in vivo 
recombination of said DNA library) ; or 

20 a process according to the invention, wherein the sets 

of homologous DNA sequences are derived from at least two 
different pre-made DNA libraries {i.e. resulting in a process 
for in vivo recombination of at least two different DNA 
libraries) . 

25 in order to obtain even further recombination between the 

DNA sequences it may be preferred to perform an isolation of the 
DNA structures after a first round of in vivo recombination of 
said DNA sequences performed according to the invention. Said 
isolated structures are then re-transformed into the cells and a 

30 further process round of in vivo recombination is then 
performed. 

The advantages of this step is illustrated by following 
example: 

Set A is a vector A wherein DNA sequences a, b are 
35 introduced individually in said vector A (i.e. the vector A 

population comprises a vector with DNA sequence a, and a vector 
with sequence b) ; and 



WO 98/28416 



17 



PCT/DKS7/00567 



Set B is a vector B wherein DNA sequences c, d are 
introduced individually in said vector B (i.e. the vector B 
population comprises a vector with DNA sequence c, and a vector 
with sequence d) . 
5 Vector A and B are then transformed into a cell 

population and in vivo recombined according to the invention. 
This will give recombination between DNA sequences ac, ad, be, 
bd. The vectors A, and B comprising said recombined sequences is 
isolated, and re-transformed into said cell population. In vivo 

10 recombination according to the invention will then give all 
possible recombination of said four (a,b,c,d) ' DNA sequences, 
such as recombination of abc, abdc, etc. 

In even further embodiments the invention relates to a 
process according to the invention, wherein two sets of homolo- 

15 gous DNA sequences ("set A" and "set B") are recombined in vivo, 
comprising the steps 

a) inserting sequence (s) from "set A" in a vector contain- 

ing a temperature resistant origin of replication and a 
gene for a first marker; 
20 b) inserting sequence (s) from "set B" in a vector contain- 

ing a temperature sensitive origin of replication and a 
gene for a second marker; 

c) transferring the vectors into a cell; 

d) Incubating the cell at a temperature where both origins 
25 of replication are functional, optionally under a selec- 
tion pressure that favours cells with both marker genes; 

e) shifting the temperature to a temperature, where only 
the temperature resistant origin of replication is func- 
tional ; 

30 f) repeating steps d)-e) at least once; 

or 

a process according to the invention, wherein two sets 
35 of homologous DNA sequences ("set A" and "set B") are recombined 
in vivo, comprising the steps 
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a) inserting sequence (s) from "set A" in a vector contain- 
ing a temperature resistant origin of replication and a 
gene for a first marker; 

b) inserting sequence (s) from "set B" in. a vector contain- 
ing a temperature sensitive origin of replication and a 
gene for a second marker; 

c) transferring the vectors into a cell; 

d) Incubating the cell at a temperature where only the tem- 
perature resistant origin of replication is functional, 
optionally under a selection pressure that favours cells 
with both marker genes; 

e) optionally, shifting the temperature to a temperature, 
where both origins of replication are functional; 

f) repeating steps d)-e) at least once. 

In a further aspect the invention relates to a method for 
producing one or more recombinant protein (s) having a desired 
biological activity comprising: 

a) performing a process for in vivo recombination of homo- 
logous DNA sequences (the process is performed according 
to the invention as described herein) , wherein at least 
one of the DNA structure (s) is an expression vector; 

b) expressing the numerous different recombinant polypeptides 
encoded by the numerous different recombined DNA sequences 
from step a) ; and 

c) screen or select the numerous different recombinant 
proteins from step b) in a suitable screening or selection 
system for one or more recombinant polypeptides (s) having 
a desired activity. 



Preferably, the expression vector in step a) above is a DNA 
structure which is unable to replicate (e.g. having a non- 
functional origin of replication) during a forced hybrid DNA 
structure formation (i.e. integration of said structures into 
each other) performed according to a process for in vivo 
recombination of homologous DNA sequences according to the 
invention. This will improve the probability that the expression 
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vector, after a crossing-out event of said hybrid-structure 
according to the invention, comprise recombined homologous DNA, 
since the only way said expression vector may have survived the 
process of forced hybrid formation is by actually recombine with 
5 another DNA structure, which is able to replicate under the 
conditions used to force said hybrid-structure formation. 

After the process for in vivo recombination have been 
performed (step a) above) , and before the expression and 
screening steps b) and c) above, it may be advantageously to 

10 make a cell population mainly comprising only the expression 
vector (i.e. other DNA construct (s) are partly or nearly 
completely removed from the cell -population) . This is particular 
advantageously when the expression vector is unable to replicate 
during the forced hybrid DNA structure formation as described 

15 immediately above, since such a cell population will have a very 
limited background of non-reconbined DNA sequences of interest, 
which is advantageously in the subsequent screening step (step 
c) above) . 

Making a cell population mainly comprising only the 
20 expression vector may be performed according to standard pro- 
cedures known in the art, such as growing the cell population at 
conditions selectively favouring replication of the expression 
vector, as compared to the other DNA structures; or selectively 
purify said expression vector form the cells and subsequently 
25 transfer this into a new cell population. 

Expression the recombinant polypeptides encoded by the 
recombined sequence in step a) above may be performed by use of 
standard expression vectors, and/or a standard expression vector 
may be modified in order to comprise elements specially suitable 
30 for a in vivo recombination process according to the invention. 

A suitable screening or selection system, to screen or se- 
lect the numerous different recombinant proteins from step b) 
above, will depend on the desired biological activity. 

A number of suitable screening or selection systems to 
35 screen or select for a desired biological activity are described 
in the art. Examples are: 
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Strauberg et al. (Biotechnology 13: 669-673 (1995), which 
describe a screening system to screen for Subtilisin variants hav- 
ing a calcium-independent stability; 

Bryan et al. (Proteins 1:326-334 (1986)), which describe a 
s screening assay to screen for Protease having a enhanced thermal 
stability; and 

PCT-DK96/00322 which describe a screening assay to screen 
for lipases which are having an improved wash performance in wash- 
ing detergents. 

10 A preferred embodiment of the invention comprise screening 

or selection of recombinant protein (s), wherein the desired bio- 
logical activity is improved performance in dish-washing or laun- 
dry detergents. Examples of suitable dish-wash or laundry deter- 
gent are disclosed in PCT/DK96/00322 and WO 95/30011. 

15 The invention also comprises a set (at least two) of 

vectors, comprising at least two different origins of replica- 
tion able to function in the same cell under permissive con- 
ditions, and at least two homologous DNA sequences encoding 
polypeptides. In a preferred embodiment, at least one origin of 

20 replication is temperature sensitive. The vector may preferable 
be a plasmid. 

The invention "further comprises a vector systen, con- 
prising at least 

a) a vector A containing a DNA sequence encoding a polypep- 
25 tide and an origin of replication, and 

b) a vector B containing a DNA sequence homologous to the 
DNA sequence in vector A, and an origin of replication 
that differs from the origin of replication in vector A. 
The vectors may preferable be plasmids. 

30 

It is preferred that the DNA sequence encodes polypep- 
tides with a biological activity, especially enzymes and in par- 
ticular enzymes such as an amylase, lipase, cutinase, cellulase, 
oxidase, phytase, and a protease. 
35 The invention further comprises a new, recombined DNA 

sequence, characterised in that has be produced by a process ac- 
cording to the invention, or it is encoding a recombinant pro- 
tein having a desired biological activity, wherein said recombi- 
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nant protein having a desired biological is produced by the 
method according to method for producing one or more recombinant 
protein (s) having a desired biological activity according to the 
invention. 

5 The invention further comprises a process for producing 

a polypeptide, especially an enzyme, comprising i) growing a 
cell expressing a polypeptide encoded by a recombined DNA se- 
quence according to claim 38, and ii) purifying said expressed 
polypeptide. 

10 The invention further comprises a new polypeptide, espe- 

cially an enzyme, characterised in that it is produced by a 
process according to the invention. 

The invention further comprises a polypeptide, espe- 
cially an enzyme, characterised in that it is encoded of a re- 
is combined DNA sequence according to the invention. 

Polypeptides of the present invention include peptides and 
proteins, and optionally glycosylated forms thereof. Examples are 
enzymes, pharmaceutically active polypeptides such as analogues 
of human or mammal polypeptides, e.g. insulin, growth hormone, 
20 human hormones or growth regulators. 

The invention is described in further detail in the 
following examples which are not in any way intended to limit the 
scope of the invention. 

2 5 EXAMPLES 

MATERIALS AND METHODS 

Strains. 

30 E.coli: SJ2 (Diderichsen, B., Wedsted, U. , Hedegaard, L., 

Jensen, B. R. , Sjoholm, C. (1990) Cloning of aldB, which encodes 
alpha-acetolactate decarboxylase, an exoenzyme from Bacillus 
brevis. J. Bacteriol., 172, 4315-4321) Electrocompetent cells 
prepared and transformed using a Bio-Rad GenePulser™ as 

35 recommended by the manufacturer. 

B.subtilis DN188.5 Diderichsen, B., Wedsted, U, , Hedegaard, L. , 
Jensen, B. R. , Sjoholm, C. (1990) Cloning of aldB, which encodes 
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alpha-acetolactate decarboxylase, an exoenzyme from Bacillus 
brevis. J. Bacterid., 172, 4315-4321). Competent cells were pre- 
pared and transformed as described by Yasbin, R.E., Wilson, G.A. 
and Young, F.E. (1975) Transformation and transfection in ly- 
5 sogenic strains of Bacillus subtilis : evidence for selective in- 
duction of prophage in competent cells. J. Bacterid, 121:296-304. 

B.suJbtilis PL1801. This strain is the B. subtilis DN1885 with 
disrupted apr and npr genes (Diderichsen, B., Wedsted, U. , 
10 Hedegaard, L. , Jensen, B. R. , Sjeholm, C. (1990) Cloning of 
aldB, which encodes alpha-acetolactate decarboxylase, an 
exoenzyme from Bacillus brevis. J, "Bacterid., 172, 4315-4321). 

Plasmids: 

is P SX120: B . subtilis expression vector (Described in EP 405901, 
and WO 91/00345) . This vector comprise the wild-type Savinase 
gene. 

pE194: (Horinouchi, S and Weisblun, B., 1982, J. Bacteriol. 
150:804-814) . 

20 P SX222: B . subtilis expression vector (Described in WO 96/34946, 
and W0 92/19729). This vector comprise the synthetic Savinase 
gene herein called Savisyn. In the Savisyn gene is introduced a 
large number of restriction sites as compared to the wild-type 
Savinase gene comprised in pSX120. 

2 5 

P UC19: Yanisch-Perron, C. , Vieria, J. and Messing, J. (1985) 
Gene 33, 103-119. 

pF64L-S65T-GFP: is the E.coli plasmid encoding the DNA fragment 
30 encoding the F64L-S65T-GFP described in WO 97/11094. 

pHUTIN-4-MCS: Plasmid can be obtained from Laboratoire de Geneti- 
que Microbienne, Institut National de la Recherche Agronomique, 
78352. Jouy en Josas - CEDEX, France. 



35 



Solutions/Media 
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TY and LB agar (as described in Ausubel, F. M. et al. 
(eds.) "Current protocols in Molecular Biology". John Wiley and 
Sons, 1995), 

LBPG is LB agar supplemented with 0.5% Glucose and 0.05 M 
5 potassium phosphate, pH 7.0 

LBPGSK plates are LBPG media with agar and 1% of skimmed 

milk. 

Antibiotics are used in different medias in concentrations 
of 6 jig/ml Chloramphenicol, 10 ^g/ml Kanamycine and 5 ug/ml og 

10 Erythromycine. 

TE-buffer (as described in Ausubel, F. M. et al. (eds.) 
"Current protocols in Molecular Biology". John Wiley and Sons, 
1995) . 

15 General molecular biology methods 

DNA manipulations and transformations were performed using 
standard methods of molecular biology (Sambrook et al. (1989) 
Molecular cloning: A laboratory nanual, Cold Spring Harbor lab., 
Cold Spring Harbor, NY; Ausubel, F. M. et al. (eds.) "Current 

20 protocols in Molecular Biology". John Wiley and Sons, 1995; 

Harwood, C. R. , and Cutting, S. M. (eds.) "Molecular Biological 
Methods for Bacillus 1 '. John Wiley and Sons, 1990). 

Enzymes for DNA manipulations were used according to the 
specifications of the suppliers. 

25 

Enzymes for DNA manipulations 

Unless otherwise mentioned all enzymes for DNA 
manipulations, such as e.g. restriction endonucleases , ligases, 
polymerases, PCR polymerases etc., are obtained from New England 
30 Biolabs, Inc. 

Construction of the temperature sensitive shuffling pl asmid 

pMB4 30 was constructed in a five step cloning process as 
depitched in the following, the entire DNA sequence is found in 
35 SEQ ID No. 1. 
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The temperature sensitive plasmid pE194 was modified by intro- 
ducing multiple cloning sites around the unique Clal site of the 
pE194 plasmid and thus destroying the Clal site. This introduc- 
tion was done by PCR using the primers #22899 and #24039 to am- 
5 plify the whole plasmid and introducing several unique cloning 
sites. 

The PCR was performed as follows: 

1 /xl of pE194 Qiaquick plasmid prep was used for 50 pi 
PCR amplification in PCR buffer (10 mM Tris-HCl, pH 8.3, 50 mM 
o KCl, 1.5 mM MgCl 2 , 0.01 % (w/v) gelatin) containing 200 /*M of 
each dNTP, 2.5 units of AmpliTaq polymerase (Perkin-Elmer , 
Cetus, USA) and 100 pmol of each -primer : 
#22899 

5 • -GCAGCTGGATCCGCGGCCGCGAATTCGTCTTTTGCGCAGTCGGC-3 ' 



5 



#24039 

5 ' -GCAGCTGGATCCGGGCCCGGGAAGCTTCGATTCACAAAAAATAGGCACACG-3 » 

The PCR reactions were performed using a DHA thermal 
o cycler (Landgraf, Germany). One incubation at 94°C for 1 min 
followed by forty cycles of PCR performed using a cycle profile 
of denaturation at 94°C for 1 min, annealing at 55°C for 1 min, 
and extension at 72°C for 2 min. Five-^1 aliquots of the ampli- 
fication product was analyzed by electrophoresis in 0.7 % 
25 agarose gels (NuSieve, FMC) . 

subclonina of PCR fragment 

Forty-f ive-Ail aliquots of the PCR products generated as 
described above were purified using QIAquick PCR purification 
30 kit (Qiagen, USA) according to the manufacturer's instructions. 
The purified DNA was eluted in 50 pi of lOmM Tris-HCl, pH 8.5. 
Fifty -Ml of the purified PCR fragment was digested with BamHI, 
electrophoresed in 0.8 % low gelling temperature agarose 
(SeaPlaque GTG, FMC) gels, the relevant fragment was excised 
35 from the gels, and purified using QIAquick Gel extraction Kit 
(Qiagen, USA) according to the manufacturer's instructions. The 
isolated DNA fragment was then ligated to BamHI digested pUC19 
and the ligation mixture was used to transform E. coli SJ2 . 
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Cells were plated on LB agar plates containing Erythro- 
mycin (200 Mg/ral) supplemente'd with X-gal (SIGMA, USA) (5-Bromo- 
4-chloro-3-indolyl alpha-D-galactopyranoside, 50 /xg/ml) . 

5 identification and characterisation of positive clones 

The transformed cells were plated on LB agar plates 
containing Erythromycin (200 /xg/ml) supplemented with X-gal (50 
/ig/ml) and incubated at 37°C over night. Next day white colonies 
were rescued by restreaking these onto fresh LB- Erythromycin 

10 agar plates and incubated at 37°C over night* The next day 
single colonies of each clone were transferred to liquid LB 
medium containing Erythromycin (200* tig/ml) and incubated 
overnight at 37°C with shaking at 250 rpm. 

Plasmids were extracted from the liquid cultures using 

15 QIAgen Plasmid Purification mini kit (Qiagen # USA) according to 
the manufacturer's instructions. Five-jxl samples of the plasmids 
were digested with BamHI . The digestion were checked by 
gelelectrophoresis on a 0.7 % agarose gel (NuSieve, FMC) . The 
appearance of two DNA fragments of about 2.7 kb and 3.7 

20 indicated a positive clone. The clone was designated MB293 and 
the plasmid pMB293. 

2) 

PMB293 was then EcoRI digested and the 3.8 kb fragment was 
25 gelpurified on a 0.7 h agarose gel and using QiaQuick gelpurifi- 
cation kit from Qiagen according to the manufacturer. This frag- 
ment which constitutes the pE194 with inserted multiple cloning- 
sites (originating from the PCR primers and from MCS of pUC19) 
was circulised by ligation and used to transform B.subtilis 
30 DN1885, transformed cells were selected on LBPG plates contain- 
ing 5 M<?/roi Erythromycin and incubation was done at 33 °C over- 
night. Clones from this transformation were restreaked and cul- 
tivated overnight in TY with 5 Mg/ml Erythromycin. From over- 
night- culture broth cells were isolated and plasmids were puri- 
35 fied using Qiaquick plasmid kit. Restriction enzyme digestion of 
the plasmids verified the correctness of the clones. One such 
clone and plasmid were termed MB333 an pMB333. 
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The pMB333 was then used as the temperature sensitive plasmid 
for the construction of a plasmid containing the the Savizyn 
gene without any promoter. This construct was made as follows. 

PGR using the primers #21547 and #21548 and the pSX222 as 
the template was performed according to these conditions: 

1 nl of P SX222 Qiaquick plasmid prep was used for 50 jil 
PGR amplification in PGR buffer (10 mM Tris-HCl, pH 8.3, 50 mM 
KC1, 1.5 mM MgCl 2 , 0.01 % (w/v) gelatin) containing 200 M M of 
each dNTP, 2.5 units of AmpliTaq polymerase (Perkin-Elmer , 
Cetus, USA) and 100 pmol of each primer: 
#21547 

S 5 ' -GCAGCTGCAGGATCCGAATTCGCGGCCGCGCAATCGGTACCATGG-3 ' 
#21548 

5 • -GCAGCGAGCTCAAGCTTCCCGGGCCCAGCCGGTCGACCGCGTTGCCGCTTCTGCG-3 ' 

The PCR reactions was performed using a DNA thermal 
o cycler (Landgraf, Germany). One incubation at 94°C for 1 min 
followed by forty cycles of PCR performed using a cycle profile 
of denaturation at 94°C for 1 min, annealing at 55°C for 1 mm, 
and extension at 72 °C for 2 min. Five-^l aliquots of the ampli- 
fication product was analyzed by electrophoresis in 0.7 % 
25 agarose gels (NuSieve, FMC) . 

cnhrlnnina of fragment 

Fortyfive-Ml aliquots of the PCR products generated as 
described above were purified using QIAquick PCR purification 
30 kit (Qiagen, USA) according to the manufacturer's instructions. 
The purified DNA was eluted in 50 M l of lOmM Tris-HCl, pH 8.5. 
Fifty -Ml of the purified PCR fragment was digested with BamHI 
and Apal, electrophoresed in 0.8 % low gelling temperature 
agarose (SeaPlaque GTG, FMC) gels, the relevant fragment was 
35 excised from the gels, and purified using QIAquick Gel 

extraction Kit (Qiagen, USA) according to the manufacturer's 
instructions. Purified fragment was ligated to gelpurified BamHI 
and Apal digested pMB33 3. The ligation was used to transform 
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B.subtilis DN1885. Transformed cells .were plated on LBPG agar 
plates containing 5 Aig/ml Erythromycin and incubated overnight 
at 33 °C. Clones were restreaked on similar plates and plasmids 
were isolated from overnight cultures of clones incubated at 
5 33 °C in TY with 5 ng/ml Erytrhomycin plasmids were isolated 
using Qiaguick plasmid kit as described by manufacturer. 
Plasmids were checked by enzymatic digestion using Apal and 
BamHI. One correct clone was termed MB339 and the plasmid 
pMB3 39. 

10 

4) 

We wanted to test an easy way of .monitoring that hybrid plasmids 
had formed. Thus we wanted to monitor the forced recombination 
between the two homologous genes on the two seperate plasmids. 

15 This was done by constructing a pE194 derivative having a pro- 
moter-lacking Savizyn transcriptionally fused to an ORF encod- 
ing GFP. In this way there would be no expression of GFP (due to 
lack of promoter upstream of the Savizyn and thus the GFP gene) 
unless the two homologous genes had recombined and thus getting 

20 the GFP under control of the promoter otherwise present on 
pSX120. The plasmid was constructed by PGR amplifying the GFP 
gene and introducing it just downstream of the savizyn gene on 
the pMB33 9. 

The GFP encoding DNA used herein is a derivative of 
25 the original wildtype gene, the derivative of GFP was cloned 
from the DNA construction of the mutant F64L-S65T-GFP which was 
constructed as described in international patent application WO 
97/11094. 

The construction of the template plasmid was de- 
30 scribed in WO 97/11094. The DNA fragment encoding the F64L-S65T- 
GFP was PCR amplifyed from the E.coli plasmid encoding this gene 
as follows: 

1 pi of pF64L-S65T-Gr ? Qiaquick plasmid prep was used 
for -50 pi PCR amplification in PCR buffer (10 mM Tris-HCl, pH 
35 8.3, 50 mM KC1, 1-5 mM MgCl 2 , 0.01 % (w/v) gelatin) containing 
200 pM of each dNTP, 2.5 units of AmpliTaq polymerase (Per- 
kin-Elmer, Cetus, USA) and 100 pmol of each primer: 
#20231 
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5 • -AACTGCAGAAGATGTGGACGCGC-3 ' 
#101381 

5 • -GCCCAATGCATAAACTGCATCCCTTAACTTGTTTTATTTGTATAGTTCATCCATGCCATG- 



3 

5 



The PCR reactions was performed using a DNA thermal 
cycler (Landgraf, Germany). One incubation at 94°C for 1 min 
followed by forty cycles of PCR performed using a cycle profile 
of denaturation at 94°C for 1 min, annealing at 55°C for 1 min, 
10 and extension at 72 °C for- 2 min. Five-^1 aliquots of the ampli- 
fication product was analyzed by electrophoresis in 0.7 % 
agarose gels (NuSieve, FMC) . 

gnhr-lnnina of fragment 
15 Forty- five-/zl aliquots of the PCR products generated as 

described above were purified using QIAquick PCR purification 

kit (Qiagen, USA) according to the manufacturer's instructions. 

The purified DNA was eluted in 50 fil of lOmM Tris-HCl, pH 8.5. 

The PCR fragment and pMB339 were digested with Clal and Nsil, 
20 gelpurified as above and ligated together. The ligation mixture 

was used to transform DN1885, transformed cells were plated on 

LBPG 5 Mg/ml Erythromycin. After 18 hours of incubation green 
flourescent cells were visible as single colonies. One of these 
were analysed and saved as MB406 and the corresponding plasmid as 
2 5 pl-3406. Since the GFP encoding ORF did not have an upstream pro- 
moter driving the transcription a possible explanation for the GFP 
expression could be a read through from a promoter further up- 
straera of the Savizyn-GFP transcriptional fusion, namely the pro- 
moter of the repF gene. 

30 

5) 

in order two abolish any read-through into the ORF of GFP, a tran- 
scriptional terminator was inserted between the repF gene and the 
Savizyn DNA. The terminator was amplified from the pMUTIN-4-MCS 
35 plasmid using the following primers and conditions: 

1 ,xl of pMUTIN-4-MCS Qiaquick plasmid prep was used for 50 ^1 
PCR amplification in PCR buffer (10 mM Tris-HCl, pH 8.3, 50 mM 
KC1, 1.5 mM MgCl 2 , 0.01 % (w/v) gelatin) containing 200 /xM of 
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each dNTP, 2.5 units of AmpliTaq polymerase (Perkin-Elmer , 

Cetus, USA) and 100 pmol of each primer: 

#102588 

5 • -CGGACGGTACCGGTAATGACTCTCTAGCTTGAGGC-3 • 
5 #102589 

5 ■ -GGGTACAGATCTCCGCGGCGCAAAAAGGCCATCCGTCAGG-3 1 

The PCR reactions was performed using a DNA thermal cycler 
(Landgraf, Germany). One incubation at 94°C for 60 sec followed 
by forty cycles of PCR performed using a cycle profile of dena- 

10 turation at 94°C for 30 sec, annealing at 55°C for 30 min, and 
extension at 72 °C for 1 min. Five-/il aliquots of the ampli- 
fication product was analyzed by electrophoresis in 0.7 % aga- 
rose gels (NuSieve, FMC) . 

The amplified DNA was purified as above and cloned as a Kpnl- 

15 Bglll fragment in pMB406 also Bglll and Kpnl digested (same proce- 
dure for gelpurifying etc, as decribed above) . The ligation mix- 
ture was used to transform DN188 5, transformed cells were plated 
on LBPG 5 ng/ml Erythromycin. After 18 hours of incubation cells 
were visible as single colonies. One of these were ananlysed and 

20 saved as MB430 and correspondingly the plasmid pMB430. This clone 
had absolutely no expression of GFP indicating that the cloned 
terminator in fact worked as anticipated. 

The resulting sequence of the entire plasmid pMB430 is listed as 
SEQ NO 1. 

25 

Establishing the NB432 strain 

PL1801 Bacillus subtilis was transformed with the two plasmids 
pMB430 and pSX120 using approximately 1 /ig of each plasmid per 100 
^1 of competent PL1801. The transformed cells were plated onto 

30 LBPG-5 /ig/ml Erythromycin, incubated overnight at 33 °C. Next day 
colonies were restreaked onto LBPG-5 pg/ml Erythromycin + 6 jig/ml 
Chloramphenicol and incubated overnight at 33 °C. Clones were also 
grown i liquid culture TY- 5 /ig/nl Erythromycin 4- 6 fig/ml Chloram- 
phenicol overnight at 3 3 C C, plasnids were purified from culture 

35 broth and the existence of both pMB4 30 and pSX120 in the PL1801 
strain was verified. This clone was saved and termed MB432. 
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EXAMPLE 1 

Sequence shuffling of two homologous sequences both coding for 
the subtilisin gene of Savinase 
5 The Savinase wt gene (comprised in pSX120) coding for a 

protease of the subtilisin type was chosen as the target for 
shuffling of highly homologous genes. A site-directed variant of 
the Savinase gene (hereafter called Savizyn, comprised in 
P SX222, where a large number of silent mutations giving new re- 

10 striction sites are introduced, was used as the shuffling part- 
ner to Savinase. The two genes were cloned on two separate com- 
patible plasmids according to the invention. A transcriptionally 
active Savinase gene was cloned on a plasmid based on the pUBHO 
origin coding for resistance to chloramphenicol (pSX120, fig. 

15 4). A promoter less Savizyn gene was cloned on a derivative of 
the temperature sensitive plasmid P E194 which also codes for re- 
sistance to erythromycin (pMB430) . Two transcription terminators 
were cloned upstream of the Savizyn gene to prevent any read- 
through. 

20 In pMB430 the green florescence protein (GFP) was tran- 

scriptionally fused to the Savizyn gene in order to be able to 
visually follow the in-cross and out-cross events between the 
two plasmids. 

B.subtilis strain M3432, pL1801 transformed with the two 
25 plasmids P SX120 and pM3430, were taken through two cycles of 
temperature shifts to induce two consecutive double-cross over 
events between the homologous genes. Homologous recombination 
and plasmid hybrid formation was selected on LBPG plates at the 
restrictive temperature (47-50°C) in the presence of erythromy- 
30 cin. Under these restrictive conditions, the temperature sensi- 
tive plasmid pMB430 is forced into the stable pSX120 plasmid by 
homologous recombination in the region of Savinase/Savizyn. 
These colonies where recombination has occurred also becomes 
fluorescent as a result of the recombination between the two 
35 plasmids and concomitant activation of the Savizyn-GFP operon 
(fig. 2). Out-cross events were screened by growing the clones 
in LB-broth at the permissive temperature of 30°C for two days 
and plating the cells on LBPG plate with chloramphenicol and 
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erythromycin. The non-fluorescent colonies were restreaked on 
plates and taken through a se'cond round of temperature cycle as 
described above. 

After the second round of shuffling, plasmid preparations 
5 were made from 20 different clones. Competent PL1801 was trans- 
formed with each of the 2 0 plasmid preps and plated on LBPG sup- 
plemented with erythromycin. The plates were replicated onto 
LBPG plates with chloramphenicol and erythromycin to identify 
unwanted clones where both pSX120 and pMB430 had been trans- 

10 formed. Clones where only the pMB43 0 plasmid had been trans- 
formed were selected for sequencing of the Savizyn region to 
identify if shuffling had taken place. It is evident from figure 
6 that recombination between the two genes Savinase and Savizyn 
had taken place. All of the 12 sequenced clones show some degree 

15 of recombination and 4 of these can positively be identified • as 
being result of two consecutive double recombination events. It 
is important to note that exchange of Savinase and Savizyn se- 
quence can only be observed if the shuffled fragment overlaps a 
restriction site that is unique for Savizyn. The four clones 

20 with a positive evidence for two double recombinations is there- 
fore the minimum number since the recombination could have oc- 
curred between two neighbouring restriction sites. 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plasmid pMB430 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GATCAGTACA AGAAAGATAC TGTATTTCAT AAACAGGAAC TGCAAGAAGT TAAGGATGAG 60 

TTACAGAAGG CAAATAAGCA GTTACAGAGT GGAATAGAGC ATATGAGGTC TACGAAACCC 120 

TTTGATTATG AAAATGAGCG TACAGGTTTG TTCTCTGGAC GTGAAGAGAC TGGTAGAAAC 180 

ATATTAACTG CTGATGAATT TGAACGCCTG CAAGAAACAA TCTCTTCTGC AGAACGGATT 240 

GTTGATGATT ACGAAAATAT TAAGAGCACA GACTATTACA CAGAAAATCA AGAATTAAAA 300 

AAACGTAGAG AGAGTTTGAA AGAAGTAGTG AATACATGGA AAGAGGGGTA TCACGAAAAA 360 

AGTAAAGAGG TTAATAAATT AAAGCGAGAG AATGATAGTT TGAATGAGCA GTTGAATGTA 4 20 

TCAGAGAAAT TTCAAG CTAG TACAGTGACT TTATATCGTG CTGCGAGGGC GAATTTCCCT 4 80 

GGGTTTGAGA AAGGGTTTAA TAGGCTTAAA GAGAAATTCT TTAATGATTC CAAATTTGAG 540 

CGTGTGGGAC AGTTTATGGA TGTTGTACAG GATAATGTCC AGAAGGTCGA TAGAAAGCGT 600 

GAGAAACAGC GTACAGACGA TTTAGAGATG TAGAGGTACT TTTATGCCGA GAAAACTTTT 660 
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TGCGTGTGAC AGTCCTTAAA ATATAC7TAG AGCGTAAGCG AAAGTAGTAG CGACAGCTAT 720 

TAACTTTCGG TTGCAAAGCT CTAGGATTTT TAATGGACGC AGCGCATCAC ACGCAAAAAG 780 

GAAATTGGAA TAAATGCGAA ATTTGAGATG TTAATTAAAG ACCTTTTTGA GGTCTTTTTT 840 

TCTTAGATTT TTGGGGTTAT TTAGGGGAGA AAACATAGGG GGGTACTACG ACCTCCCCCC 900 

TAGGTGTCCA TTGTCCATTG TCCAAACAAA TAAATAAATA TTGGGTTTTT AATGTTAAAA 960 

GGTTGTTTTT TATGTTAAAG TGAAAAAAAC AGA7G7TGGG AGGTACAGTG ATGGTTGTAG 1020 

ATAGAAAAGA AGAGAAAAAA GTTGCTGTTA CTTTAAGACT TACAACAGAA GAAAATGAGA 1080 

TATTAAATAG AATCAAAGAA AAATATAATA TTACCAAATC AGATGCAACC GGTATTCTAA 1140 

TAAAAAAATA TGCAAAGGAG GAATACGGTG CATTTTAAAC AAAAAAAGAT AGACAGCACT 1200 

GGCATGCTGC CTATCTATGA CTAAATTTTG TTAAGTGTAT TAGCACCGTT ATTATATCAT 12 60 

GAGCGAAAAT GTAATAAAAG AAACTGAAAA CAACAAAAA7 TCAAGACGAC GTAATTGGAC 1320 

ATTTGTTTTA TATCCAGAAT CAGCAAAAGC CGAGTCGTTA GAGTATTTAA AAGAGTTACA 1380 

CATTCAATTT GTAGTGTCTC CATTACATGA TAGGGATACT GATACAGAAG GTAGGATGAA 1440 

AAAAGAGCAT TATCATATTC TAGTGATGTA TGAGGGTAAT AAATCTTATG AACAGATAAA 1500 

AATAATTAAC AGAAGAATTG AATGCGACTA 7TCCGCAGAT TGCAGGAAGT GTGAAAGGTC 1560 

TTGTGAGATA TATGCTTCAC ATGGACGATC CTAATAAATT TAAATATCAA AAAGAAGATA 1620 

TGATAGTTTA TGGCGGTGTA GATGTTGATG AA77A7TAAA GAAAACAACA ACAGA7AGA7 1680 

ATAAA77AA7 7AAAGAAA7G A77GAG777A 77GA7GAACA AGGAATCG7A GAA7T7AAGA 1740 

GT7TAA7GGA 77ATGCAA7G AAG777AAAT 77GA7GA77G G77CCCGC77 T7A7G7GA7A 1800 
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ACTCGGCGTA TGTTATTCAA GAATATATAA AATCAAATCG GTATAAATCT GACCGATAGA 1860 

TTTTGAATTT AGGTGTCACA AGACACTCTT TTTTCGCACC AGCGAAAACT GGTTTAAGCC 1920 

GACTGCGCAA AAGACGAATT CGAGCTCGGT ACCGGTAATG ACTCTCTAGC TTGAGGCATC 1980 

AAATAAAACG AAAGGCTCAG TCGAAAGACT GGGCCTTTCG TTTTATCTGT TGTTTGTCGG 2040 

TGAACGCTCT CCTGAGTAGG ACAAATCCGC CGCTCTAGCT AAGCAGAAGG CCATCCTGAC 2100 

GGATGGCCTT TTTGCGCCGC GGAGATCTAA ATATTCGTGG TGCCCCAAGC TTTGTACCAG 2160 

GGGAACCGTC GACTCAAGAT GGGAATGGGC ATGGCACGCA TGTGGCCGGG ACGATCGCTG 2220 

CTTTAAACAA TTCGATTGGC GTTCTTGGCG TAGCGCCGAG CGCTGAGCTA TACGCTGTTA 2280 
AAGTCCTAGG GGCGAGCGGT TCAGGTTCGG TCACCTCGAT TGCCCAAGGA TTGGAATGGG 2340 
CAGGGAACAA TGGCATGCAC GTTGCTAATT TGAGTTTAGG AAGCCCTTCG CCAAGTGCCA 2400 
CACTCGAGCA AGCTGTTAAT AGCGCGACTT CTAGAGGCGT TCTTGTTGTA GCGGCATCTG 2460 
CGAATTCAGG TGCAGGCTCA ATCAGCTATC CGGCGCGCTA TGCGAACGCA ATGGCAGTCG 2520 
GAGCTACTGA TCAAAACAAC AACCGCGCTA GCTTTTCACA GTATGGCGCA GGCCTTGACA 2580 
TTGTCGCACC CGGGGTAAAC GTGCAGAGCA CATACCCAGG TTCAACATAT GCCAGCTTAA 2640 
ACGGTACATC GATGGCTACT CCTCATGTTG CAGGTGCGGC CGCCCTTGTT AAACAAAAGA 2700 
ACCCATCTTG GTCTAATGTA CAAATTCGAA ATCATCTAAA GAATACGGCA ACTAGTTTAG 2760 
GAAGCACGAA CTTGTATGGA AGCGGACTTG TTAACGCAGA AGCGGCAACG CGTTAAAAAT 2820 
GAGGAGGGAA GCTTTATGAG TAAAGGAGAA GAACTTTTCA CTGGAGTTGT CCCAATTCTT 2880 
GTTGAATTAG ATGGCGATGT TAATGGGCAA AAATTCTCTG TTAGTGGAGA GGGTGAAGGT 2940 
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GATGCAACAT ACGGAAAACT TACCCTTAAA 
CCATGGCCAA CGCTTGTCAC TACTCTCTCT 
GATCATATGA AACAGCATGA CTTTTTCAAG 
AGAACTATAT TTTACAAAGA TGACGGGAAC 
GGTGATACCC TTGTTAATAG AATCGAGTTA 
ATTCTTGGAC ACAAAATG GA ATACAATTAT 
AAACCAAAGA ATGGCATCAA AGTTAACTTC 
GTTCAATTAG CAGACCATTA TCAACAAAAT 
CCAGACAACC ATTACCTGTC CACGCAATCT 
GATCACATGA TCCTTCTTCA CTTTGTAACA 
CTATACAAAT AAAACAAGTT AAGCCATCCA 
TAATTTATAG CTATTGAAAA GACATAACAA 
AAT7CCTGCA TGTTTTAAGG AATTGTTAAA 
TTTGTTAACC CATTTCATAA CGAAATAATT 
TGATTTTTTT CTACTTAATC TGATAAGTGA 
TTCTCTTGGA ACCATACTTA ATATAGAAAT 
TGAGCGTTTT GTATTTAATA ATCTTTTAGC 
ATTAGCTATA CTATCAAAAA CAATTTTGCG 
ATTACCATAT ATTTTATAGG ATTGGTTTTT 
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TTTATTTGCA CTACTGGGAA GCTACCTGTT 3000 

TATGGTGTTC AATGCTTTTC TAG ATA C CCA 3060 

AGTGCCATGC CCGAAGGTTA TCTACAGGAA 3120 

TACAAGACAC GTGCTGAAGT CAAGTTTCAA 3180 

AAAGGTATTG ATTTTAAAGA AGATGGAAAC 3240 

AACTCACATA ATG TATA CAT CATGGCAGAC 3300 

AAAATTAGAC ACAACATTAA AGATGGAAGC 3360 

ACTCCAATTG GCGATGGCCC TGTCCTTTTA 3420 

GCCCTTTCCA AAGATCCCAA CGAAAAGAGA 3480 

GCTGCTGGGA TTACACATGC CATGGATGAA 3540 

GTTTATGCAT CCCTTAACTT ACTTATTAAA 3600 

TTGTTCAAAG C7AATATTGT TTAAATCGTC 3660 

TTGATTTTTT GTAAATATTT TCTTGTATTC 3720 

ATACTTTTGT TTATCTTTGT GTGATATTCT 3780 

GCTATTCACT TTAGGTTTAC GATGAAAATA 3840 

ATCAAC7TCT GCCATTAAAA GTAATGCCAA 3900 

AAACCCGTAT TCCACGATTA AATAAATCTC 3960 

TATTATATCC GTACTTATGT TATAAGGTAT 4020 

AGGAAATTTA AACTGCAATA TATCCTTGTT 4080 
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TAAAACTTGG AAATTATCGT GATCAACAAG TTTATTTTCT GTAGTTTTGC ATAATTTATG 4140 

GTCTATTTCA ATGGCAGTTA CGAAATTACA CCTCTTTACT AATTCAAGGG TAAAATGGCC 4200 

TTXTCCTGAG CCGATTTCAA AGATATTATC ATGTTCATTT AATCTTATAT TTGTCATTAT 4260 

TTTATCTATA TTATGTTTTG AAGTAATAAA GTTTTGACTG TGTTTTATAT TTTTCTCGTT 4320 

CATTATAACC CTCTTTAATT TGGTTATATG AATTTTGCTT ATTAACGATT CATTATAACC 4380 

ACTTATTTTT TGTTTGGTTG ATAATGAACT GTGCTGATTA CAAAAATACT AAAAATGCCC 4440 

ATATTTTTTC CTCCTTATAA AATTACTATA ATTA7ACCAC GAGCTCTGAT AAATATGAAC 4 500 

ATGATGAGTG ATCGTTAAAT TTATACTGCA ATCGGATGCG ATTATTGAAT AAAAGATATG 4560 

AGAGATTTAT CTAATTTCTT TTTTCTTGTA AAAAAAGAAA GTTCTTAAAG GTTTTATAGT ■ 4 620 

TTTGGTCGTA GAGCACACGG TTTAACGACT TAATTACGAA GTAAATAAGT CTAGTGTGTT 4 680 

AGACTTTATG AAATCTATAT ACGTTTATAT ATATTTATTA TCCGGAGGTG TAGCATGTCT 4740 

CATTCAATTT TGAGGGTTGC CAGAC7TAAA GGATCAAGTA ATACAAACGG GATACAAAGA 4800 

CATAATCAAA GAGAGAATAA AAACTATAAT AATAAAGACA TAAATCATGA GG AAA CAT AT 4 860 

AAAAATTATG ATTTGATTAA CGCACAAAAT ATAAAGTATA AAGATAAAAT TGATGAAACG 4920 

ATTGATGAGA ATTATTCAGG GAAACGTAAA ATTCGGTCAG ATGCAATTCG ACATGTGGAC 4980 

GGACTGGTTA CAAGTGATAA AGATTTCTTT GATGATTTAA GCGGAGAAGA AATAGAACGA 5040 
TTTTTTAAAG ATAGCTTGGA GTTTCTAGAA AATGAATACG GTAAGGAAAA TATGCTGTAT 5100 
GCGACTGTCC ATCTGGATGA AAGAGTCCCA CATATGCACT TTGGTTTTGT CCCTTTAACA 5160 
GAGGACGGGA GATTGTCTGC AAAAGAACAG TTAGGCAACA AGAAAGACTT TACTCAATTA 5220 
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CAAGATAGAT TTAATGAGTA TGTGAATGAG AAAGGTTATG AACTTGAAAG AGGCACGTCC 5280 
AAAGAGGTTA CAGAACGAGA ACATAAAGCG ATG 5313 
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Claims 

1. A process for in vivo recombination of homologous DNA se- 
quences, comprising the following steps 

5 a) incubating a cell containing at least two DNA struc- 

tures, each comprising a DNA sequence and an origin of 
replication, under conditions which favour integration 
of one of the DNA-structures into one of the others, 
thereby forming a hybrid DNA structure; 
10 b) incubating the cell under conditions which favour cross 

ing out from said hybrid DNA structure, thereby forming 
novel DNA structures, each 'comprising a recombined DNA 
sequence and an origin of replication; 
c) repeating steps a)-b) at least once, 

15 , 

2. A process for in vivo recombination of homologous DNA se- 
quences, comprising the steps 

a) incubating a cell containing at least two DNA struc- 
tures, each comprising a DNA sequence and an origin of 

20 replication, under conditions where i) at least one ori 

gin of replication is non-functional and ii) the cell i 
under a selective pressure which only allows growth of 
the cell if it contains the DNA structure which is hav- 
ing the non-functional origin (i.e. favoring an integra 

25 tion of one of the DNA-structures into one of the oth- 

ers, thereby forming a hybrid DNA structure); 

b) changing of said conditions to conditions, where fewer 
origins of replication are non-functional than in step 
a) (i.e. favoring a crossing out from said hybrid DNA 

30 structure) ; 

c) repeating steps a)-b) at least once. 

3. The process according to claim 1-2, characterized in that 
at le.ast one DNA structure is a chromosome, preferred a bacte- 
35 rial chromosome, more preferred a Bacillus chromosome, espe- 
cially a Bacillus subtilis chromosome. 
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4. The process according to claim 1-3 , characterized in that 
at least one DNA structure is a vector, preferred a bacterial 
vector, more preferred a vector able to replicate in a Bacillus 
cell, especially a Bacillus subtilis cell. 

5 

5. The process according to claim 4, wherein at least two of 
the DNA constructs are two different vectors ♦ 

6. A process for in vivo recombination of homologous DNA se- 
10 quences, comprising the steps 

a) inserting the homologous DNA sequences in at least two 
different vectors, said vectors contain different origins 
of replication; 

b) transferring the vectors into a cell; 

15 c) incubating the cell under conditions where i) at least one 
origin of replication is non-functional and ii) the cell 
is under a selective pressure which only allows growth of 
the cell if it contains the DNA structure which is having 
the non-functional origin (i.e. favoring an integration of 

20 one of the DNA-structures into one of the others, thereby 

forming a hybrid DNA structure) ; 

d) changing of said conditions to conditions, where fewer 
origins of replication are non-functional than in step c) 
(i.e. favoring a crossing out from said hybrid DNA struc- 

25 ture) ; 

e) repeating steps c)-d) as many times as wanted, however at 
least once. 

7. The process according to claim 1-6, characterised in that 
30 at least one origin of replication is selected in such a way 

that the replication rate of the vector can be regulated chemi- 
cally or physically. 

8. .The process according to claim 7, characterised in that at 
35 least one origin of replication is temperature sensitive. 
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9 The process according to claim 8, characterised in that the 
replication rate of at least one vector can be regulated by a 

temperature shift. 

5 io The process according to claim 1-9, characterised in that 
the vectors contain genes for markers, preferred selectable 
markers, especially antibiotic markers. 

11. The process according to claim 10, characterised in that the 
10 vectors contain genes for different markers. 

12. The process according to any. of the preceding claims, 
wherein at least one DNA structure do not comprise ori(-) (i.e. 
only comprise Ori(+)). 

" 13 The process according to claim 12, wherein the DNA structure 
which do not comprise Ori(-) is a DNA structure which origin 
(i.e. herein ori<+>) is functional during the hybrid DNA struc- 
ture formation performed according a process according to any of 

20 the preceding claims. 

14 The process according to any of the preceding claims, 
wherein it is formation of the hybrid structure is measured an 
vivo, preferably by having a system of DNA structures, wherein 
25 an in vivo screenable protein is expressed only after the hybrid 

structure has been formed. 

15. The process according to claim 14, wherein said screenable 
protein is a protein which is fluorescent under suitable expo- 
30 sure, and in particular said fluorescent protein is Green Flo- 
rescent protein (GFP) or variants thereof. 

16 The process according to any of the preceding claims, 
wherein crossing out from the hybrid DNA structure is a forced 
35 crossing out event. 

17. The process according to claim 16, wherein the forced cross- 
ing out from the hybrid structure is performed by: 
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i) constructing a system wherein the DNA hybrid struc- 
ture do not express a selectable marker , and at 
least one of the at least two DNA structures accord- 
ing to any of the preceding claims is able to ex- 
press said selectable marker; 

ii) incubating the cell under selective conditions, 
thereby forcing a cross out of the DNA hybrid struc- 
ture, since the cell is only able to express the 
marker gene after the crossing out of the DNA hybrid 
structure has taken place. 

18. The process according to 17, wherein the selectable marker 
in step i) is expressed on a DNA structure where the origin of 
replication is non-functional during a forced integration of the 
DNA structures into a DNA hybrid structure performed as de- 
scribed above, and preferably the selectable marker gene is an 
antibiotic resistance gene. 

19. The process according to any of the preceding claims, 
wherein the homologous DNA sequences comprise DNA sequences from 
at least one pre-made DNA library. 

20. The process according to any of the preceding claims, 
wherein the homologous DNA sequences are comprised of two or 
more different set of homologous DNA sequences ("set A 11 "set B", 
"setC" etc.), and wherein each of said set of homologous se- 
quences ("set A" "set B" , "setC" etc.) are inserted in a differ- 
ent DNA construct ("DNA construct A", "DNA construct B", "DNA 
construct C" etc..) according to any of the preceding claims. 

21. The process according to claim 20, wherein at least one of 
said sets of homologous DNA sequences is derived from a pre-made 
DNA library, and at least one of said sets of homologous DNA se- 
quences is a DNA sequence which are homologous to said DNA li- 
brary (i.e. resulting in a process for in vivo recombination of 
said DNA library to a DNA sequence which are homologous to said 
DNA library) . 
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22. The process according to 20, wherein said sets of homologous 
DNA sequences is derived from the same pre-made DNA library 
(i.e. resulting in a process for in vivo recombination of said 
DNA library) . 

23. The process according to claim 20, wherein said sets of ho- 
mologous DNA sequences are derived from at least two different 
pre-made DNA libraries (i.e. resulting in a process for in vivo 
recombination of at least two different DNA libraries) . 

24. The process according to claim 20, wherein said sets of ho- 
mologous DNA sequences each only -contain one DNA sequence (i.e. 
recombination of two or more homologous sequences) . 

is 25. The process according to any of the preceding claims, 

wherein two sets of homologous DNA sequences ("set A" and "set 
B") are recombined in vivo, comprising the steps 
a) inserting sequence (s) from "set A" in a vector contain- 

ing a temperature resistant origin of replication and a 
gene for a first marker; 

inserting sequence (s) from "set B" in a vector contain- 
ing a temperature sensitive origin of replication and a 
gene for a second marker; 
transferring the vectors into a cell; 
25 d) incubating the cell at a temperature where both origins 

of replication are functional, optionally under a selec- 
tion pressure that favours cells with both marker genes 
shifting the temperature to a temperature, where only 
the temperature resistant origin of replication is func 

30 tional; 

f) repeating steps d)-e) at least once. 

26. The process according to any of the preceding claims, 
wherein two sets of homologous DNA sequences ("set A" and "set 
35 B") are recombined in vivo, comprising the steps 

a) inserting sequence (s) from "set A" in a vector containing 
temperature resistant origin of replication and a gene for a 
first marker; 



20 

b) 



c) 



e) 
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b) inserting sequence (s) from "set B" in a vector contain- 
ing a temperature sensitive origin of replication and a 
gene for a second marker; 

c) transferring the vectors into a cell; 

d) incubating the cell at a temperature where only the tem- 
perature resistant origin of replication is functional, 
optionally under a selection pressure that favours cells 
with both marker genes; 

e) optionally, shifting the temperature to a temperature, 
where both origins of replication are functional; 

f) repeating steps d)-e) at least once. 



27. The process according to claim 1-26, characterised in that 
the cell containing the DNA structures is defective or transito- 
15 rily inactivated in the enzymatic mismatch repair system or has 
a reduced level of mismatch repair. 



28. The process according to claim 1-27, characterised in that 
the cell is a Gram positive cell, such as a Staphylococcus cell, 

20 a Streptococcus cell, a Bacillus cell, or in particular a Bacil- 
lus subtilis cell. 

29. The process according to claim 1-28, characterised in that 
the cell is a Gran negative cell, preferred an Escherichia cell, 

25 especially an E. coli cell. 

30. The process according to claim 1-29, characterised in that 
the homologous DNA sequences are encoding polypeptides, espe- 
cially enzymes, and in particular enzymes such as an amylase, 

30 lipase, cutinase, cellulase, oxidase, phytase, and a protease. 

31. A method for producing one or more recombinant protein(s) 
having a desired biological activity comprising: 

a) " performing a process for in vivo recombination of 
35 homologous DNA sequences according to any of the preceding 

claims, wherein at least one of the DNA structure (s) is an 
expression vector; 
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expressing the numerous different recombinant polypeptides 
encoded by the numerous different recombined DNA sequences 
from step a) ; and 

screen or select the numerous different recombinant 
proteins from step b) in a suitable screening or selection 
system for one or more recombinant polypeptides (s) having 
a desired activity. 

A set of at least two vectors, where at least two vectors 
are comprising at least two individually different origins of 
replication able to function in the same cell under permissive 
conditions, and two homologous DNA' sequences encoding polypep- 
tides, said DNA sequences having more than 50% homology, prefer- 
able more than 7 0%. 

33. A vector system, comprising at least 

a) a vector A containing a DNA sequence encoding a polypep- 

tide and an origin of replication 

a vector B containing a DNA sequence homologous to the 
DNA sequence in vector A, and an origin of replication 
that differs from the origin of replication in vector A. 



b) 



34. The vector system according to claim 33, characterised in 
that the DNA sequences encodes polypeptides with a biological 
25 activity, especially enzymes. 



35. A ce 



11 containing a vector system according to claim 33-34 



36. The cell according to claim 35, characterised in that is a 
30 Gran positive cell, preferred a Bacillus cell, especially a Ba- 

cillus subtilis cell. 

37. The cell according to claim 35, characterised in that the 
cell. is a Gram negative cell, preferred an Escherichia cell, es- 

35 pecially an E. coli cell. 

38. A recombined DNA sequence, characterised in that it is a 
result of a process according to one or more of the claims 1-30, 
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or it is encoding a recombinant protein having a desired bio- 
logical activity, wherein said recombinant protein having a de- 
sired biological is produced by the method according to claim 
31. 

39. A process for producing a polypeptide, especially an en- 
zyme, comprising i) growing a cell expressing a polypeptide en- 
coded by a recombined DNA sequence according to claim 38, and 
ii) purifying said expressed polypeptide. 

40. A polypeptide, especially an enzyme, characterised in that 
it is produced by a process according to claim 39. 

41. A polypeptide, especially an enzyme, characterised in that 
it is encoded by the recombined DNA sequence according to clain 
38. 
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